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Abstract 

In this article we provide ne w applications for e xponenti al approx- 



in pressl ). which is 



imation using the framework of iPekoz and Rollin 
based on Stein's method. We give error bounds for the nearly criti- 
cal Galton- Watson process conditioned on non-extinction, and for the 
occupation times of Markov chains; for the latter, in particular, we 
give a new exponential approximation rate for the number of revisits 
to the origin for general two dimensional random walk, also known as 
the Erdos- Taylor theorem. 



1 INTRODUCTION 
A new framework for estimati ng the error of the exponent ial approxima- 



tion was recently developed in Pekoz and Rollin ( in pressl ). where it was 



applied to geometric sums, Markov chain hitting times, and the critical 
Galton- Watson conditioned on non-ext inction. In this a rticle we provide 
some generalizations to the approach of Pekoz and Rollin ( in pressl ) and ap- 



ply th em to study Markov chain occupation times and a result of Erdos and 
Taylor diagj" for the number of visits to the origin by the two dimensional 



random w alk, as well as to get a rate for the result of Fahadv. Quine, and 



Vere-Jones (Il97ll ) for the nearly critical Galton- Watson branching process 



conditioned on non-extinction. 



Pekoz and Rollin ( 


in Dressi) 


z. Ross and Pekoz 


( 


2007) 1 



can be thought of as formalizing the intuitive notion that a random variable 
X has approximately an exponential distribution if X and X e are close in 
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distribution, where X e has the equilibrium distribution with respect to X 
characterized by 



The equilibrium distribution appears in renewal theory as the time until the 
next renewal starting from steady-state. A renewal process with exponen- 
tial inter-renewal times has the exponential distribution for its equilibrium 



distributi on, and so the above intuition is not surprising. iPekoz and Rollin 



m 



press! ) give bounds on the accuracy of the exponential approximation 



in terms of how closely X and X e can be coupled together on the same 
probability space; one version of the result we will use below can be written 

as 

sup|P[X > x] - e~ x/]EX \ < 2.46y / E|X-X e |. 

Some heuristics for Stein's method can be understood using size-biased 
random variables. For a continuous random variable X with probabil- 
ity density function f(x), the size-biased random variable X s has density 
x/(x)/EJ. The size of the renewal interval containing a randomly chosen 
point as well as the children in the family o f a randomly chosen child are 



examples of size-biased random variables; seelBrownl (120061) and Arratia and 



Goldstein (l2010h for surveys and applications of size biasing. 



Stein's method for the exponential distribution, as well as for some other 
nonnegative distributions, can be viewed in terms of size-biasing. For the 
Pois son approximation to some random variable X, the Stein-Chen method 
(see Barbour. Hoist, and Janson ( 19921 )) gives a bound on the error in terms 



of how closely X and X s — 1 can be coupled together on the same probability 
space; these both have exactly the same distribution when X has a Pois- 
son distribution. For approximati on by a binomial distribution fsee Pekoz. 
Rollin, Cekanavicius, and Shwartz (j2009h ). we can obtain a bound in terms 
of how closely X s — 1 and n — (n — X) s can be coupled; both of these have 
exactly the same distribution if X is binomial with parameters n and p. For 
the exponential distribution, we can obtain a bound on the error in terms of 
how closely X and UX S can be coupled, where U is an independent uniform 
(0,1) random variable independent of all else; X e has the same distribution 
as UX S . This last approach is the one we use below for the nearly critical 
Galton Watson process conditioned on non-extinction. 

The organization of this article is as follows. In Section 2 we give the 
notation, background and preliminaries. In Section 3 we consider the setting 
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of a nearly critical Galton Watson branching process conditioned on non- 
extinction. In Section 4 we study general dependent sums, occupation times 
for Markov chains and the the number of times the origin is revisited for the 
two-dimensional general random walk. 

2 PRELIMINARIES 

We first define the probability distance metrics we use below. For two prob- 
ability distributions F and G define the Kolmogorov distance metric as 

d K (F,G) = sup\F(x) -G(x)\. 

If both distributions have finite expectation, define the Wasserstein distance 
metric 

d w (F,G) = f \F(x) - G(x)\dx. 
We can relate the two metrics using 



sec c 



d K (P,Exp(l)) < 1.74 v / d w (P,Exp(l)); 
.g. iGibbs and St3 (|2002h . 



Central to the approach in Pekoz and Rollin ( in press! ) is the equilibrium 



distribution from renewal theory, and we next give the definition we use. 

Definition 2.1. Let X be a non-negative random variable with finite mean. 
We say that a random variable X e has the equilibrium distribution w.r.t. X 
if for all Lipschitz-continuous / 

E/(I)-/(0)=EIE/'(r). (2.1) 

It is straightforward that this implies 

nX e ^x] = ^£p[X>y]dy. 

Indeed for nonnegative X having finite first moment, define the distribution 
function 



F e {x) = ^ j\[X>y\dy 



on x ^ and F e {x) = for x < 0. Then 

-X 



Ef(X) - /(0) = E / f'(s)ds 
Jo 

f'OO f'OG 

= E / f(s)I[X > s]ds = / f'(s)~P[X > s]ds 
Jo Jo 
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so that F e is the distribution function of X e and our definition via (|2.ip is 
consistent with that from renewal theory. 

The size biased distribution will also be used below. We define it as 
follows. 

Definition 2.2. Let X be a non-negative random variable with finite mean. 
We say that a random variable X s has the size-biased distribution w.r.t. X 
if for all bounded / 

E{I/(I)}=EIE/(I S ). (2.2) 

We next present the key result from Pekoz and Rollin ( in press! ) that we 
will use in the applications that follow. 

Theorem 2.1 ( Pekoz and Rollin ( in pressl ) . Theorem 2.1). Let W be a non- 
negative random variable with JEW = 1 and let W e have the equilibrium 
distribution w.r.t. W. Then, for any /3 > 0, 

dK(jSf(W),Exp(l)) < 12(3 + 2P{\W e -W\ > p], 

and, if in addition W has finite second moment, 

dw(jSf(W),Exp(l)) < 21E\W e - W\ (2.3) 



THE NEARLY CRITICAL GALTON- WATSON BRANCHING 

PROCESS 



It was shown by lYagloml (|1947l ) that the limiting distribution of a critical 
Galton- Watson branching process conditioned on non-extinction is expo- 
nenti al. A corresponding rate of convergence was first proved by Pekoz and 
Rollin ( in pressl ). In the super- and sub-critical cases, the limiting distri- 



butions are very diffi cult to cal c ulate a nd are only known e xplicitly in very 
special cases; see e.g. iBingham (|l988h . iFahadv et ail (|l97lh . however, were 
able to show that the limiting distribution of a nearly critical branching pro- 
cess conditioned on non-extinction converges to the exponential distribution 
as m — > 1 over a general class of offspring distributions. The following the- 
orem gives explicit error bounds for the exponential approximation for any 
finite n and any m/1. The bounds obtained confirm the exponential limit 
as m — > 1 for the limiting distribution under the assumptions of a finite 
third moment for the offspring distribution and a non-zero probability of 
having 2 or more offspring. 
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Theorem 3.1. Consider a Galton-Watson branching process starting from 
a single particle at time zero, and let Z n be the size of the nth generation. 
Assume P[Zi ^ 2] > 0, m = EZi / 1 and E|Zi| 3 < oo. Let a = P[Zi = 
l]/F[Zi > 2], 

C = (2 + a)(2 + o + VarZi + E|Z!| 3 ), 
and A = l/E(Z n |Z„ > 0) = P[Z n > 0]/m n . Then 

d w (^{XZ n \Z n > 0),Exp(l)) < Cr](n,m), (3.1) 

where 

1 — m (1 — m) 2 \ m 2 - 7 

nim n) = -I- ? 

n ' ' 2(1 -m n ) m(l - m n ) j-^ 1 - mi' 

In particular, for any n 1 and m > 1, 

V (m, n) ^ 6(m - 1) + 6-5 + ^° s(n) , (3.2) 
and, /or any n ^ 2 and ^ ^ m < 1, 

n(m, n) < (4.5 - log(l - m)) (1 - m) + 4 - 5 + lp g( n ) _ (3 3) 

' n 

Note that as a consequence of (|3.2|) and (|3.3|) we obtain that the limit- 
ing distribution of a nearly critical branching process conditioned on non- 
extinction converges to the exponential as m — > 1 under the very mild 
condition that C does not grow too fast. This is in particular true if the 
third moment of the offspring distribution remains bounded from above and 
P[Z\ J? 2] remains bounded away from zero as m — > 1. 

Proof of T heorem\3.1\ With some mod ifications, we follow the line of argu- 



ment from iPekoz and Rollinl din press!), which is b ased on the size-biased 



branching tree of Lyons. Pemantle. and Peresl ( 19951 ). 



We assume that the particles in the tree are labeled and ordered. That 
is, if w and v are two particles in the same generation, then all offspring of 
w are to the left of the offspring of v, whenever w is to the left of v. We start 
in generation with one particle vq and let it have a size-biased number of 
offspring. Then we pick one of the offspring of vq uniformly at random and 
label it v\. For each of the siblings of v\ we continue with an independent 
Galton-Watson branching process with the original offspring distribution. 
For v\ we proceed as we did for vq, i.e., we give it a size-biased number of 
offspring, pick one uniformly at random, label it V2, and so on. 
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Denote by S n the total number of particles in generation n. Denote by 
L n and R n , respectively, the number of particles to the left (exclusive v n ) and 
to the right (inclusive v n ), respectively, of v n . Denote by S n j the number of 
particles in generation n that stem from any of the siblings of Vj (but not Vj 
itself). Likewise, let L n j and R n ,j, respectively, be the number of particles 
in generation n that stem from the siblings to the left and right, respectively, 
of Vj. We have the relations L n = Y^j=i L n ,j and R n = ^ + Sj=i Rn,j- 

Next let R' n • be independent random variables such that 

•^(R'nj) = ^{Rn,i\Ln,j = 0). 

and, with A n j = {L n j = 0}, define 

RnJ = Rn,jlA nJ + Rn,j J A c n j = R71J + (R'n,j ~ ^n,]) 1 ^- 

Define also R n = 1 + Y^j=i ^nj- Below are a few facts that we will subse- 
quently use to give the proof of the theorem. In what follows, let a 2 = Var Z\ 
and 7 = El-Z^ 3 . 

{%) The size-biased distribution of £?(X) is the same as that of J£(X\X > 
(ii) S n has the size-biased distribution of J£(Z n ); 
(Hi) v n is uniformly distributed among the particles of generation n; 
iiv) J?(R*) = J?(Z n \Z n > 0); (v) E{R' nd I A o n .} ^ m-VP^]; 

(vi) !E{R n jI A c n J ^m n - j 7 P[A c nJ ]; (vii) V[A c n J ^ m" 1 o 2 V[Z n ^ 3 > $\. 

, m n (l — m) 

(viii) P[Z n > < 1 + a — -i -± 

1 — m n 

For (i)-(iv) see Pekoz and RollinI ( in press ). Using independence, 

E{R' ntj I A c nj } = ER' nd P[A c ntj ] < E^PKJ < m n -ia 2 P[A n J, 

which proves (v). If Xj denotes the number of siblings of Vj, having the 
size-biased distribution of Jzf (Z{) minus 1, we have 

nRnjlAo.} ^ m^nXjlA^.} < ^m^kPlXj = k,A nJ ] 

k 

^J2 mn ~ jkP i X j = k M A n,j\ X j = k \ 
k 



k 



hence (vi). Now, 

P[A c nd ] = EjPKJIj]} < miXjViZn-j > 0]} = m-VP[Vj > 0], 



which proves (vii). Finally, using the Corollary on page 356 of iFuiimagari 
dl980h . we have 

P[Z n >0] < (2 + a) _ n 

m n — 1 

which is (otm) (note that the result cited is for bounded offspring distribu- 
tion, but easily extends to the unbounded case). 

Due to (iv) we can set W = Ai?*. Due to (£) and (ii), 5 n has the 
size-biased distribution with respect to i?*. Let C7 be an independent and 
uniform random variable on [0,1]. Now, R n — U is a continuous random 
variable taking values on [0, S n ] and, due to (in), has distribution J?(US n ); 
hence we can set W e = X(R n — U). Therefore, we can apply (|2.3j) and use 
(v)-(vii) to obtain 

E[PF - W e \ < A/2 + AE|i?* - 

n 

< A/2 + A ^{KjlA^ + Rn,jlA^ } 

2 m. 2 .? 



(2 + a)(l-ro) C(l-m) 2 ^ m 



<^ — : — ^ - -| ^ '— > - 

2(1 -m n ) m(l-m n )^l 

which proves (|3.ip . 

Let now m > 1. Note that m 2 - 5 /{rn? — 1) is an increasing function in j, 
except possibly at the beginning where it may be decreasing. Hence 



Em 3 m 2m 2n f m 2x , 

— ; ^ 1 h / dx 

m? — 1 m — 1 m n — 1 /i m x — 1 
j= i J i 



m 2 2m 2 " | m"-m + log(^) 



m — 1 m n — 1 log(m) 
which implies that 

m — 1 m(m — 1) 2m 2n (m — 1Y 
rjim, n) < — H + 



2(m n — 1) m n — 1 m(m n — l) 2 



^ ( m -l)2(m"- m ) (m - l) 2 Iog(^ 



m(m n — 1) log(m) m(m n — 1) log(m) 
=: ri + r 2 + r 3 + r 4 + r 5 . 
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Recall that 

n-l 

m 



n i 

= V m k ^ (3.4) 

l- 1 f-' 

This implies that 



fe=o 



1 m m — 1 1 1 

ri ^ — , r 2 < — = 1 — < m — H — . 

In n n n n 



and, using that (a + 6) 2 < 2a 2 + 26 2 



2 /m n (m — 1) \ 2 2/ m — 1 \ 2 

r3 = -{ ~ T 1 ) =- m-l + 



m \ m n — 1 / m V m 

4(m - l) 2 4 , ,4 
^ + 5 < 4 m - 1 + -. 

m mr re 

Furthermore, recalling that (m — 1) ^ rrelog(rre), 



(m — 1) (m n — m) 

r 4 ^ ^ '- ^ m - 1. 

m n — 1 



Finally, 



(m-l) 2 log(^) m-1 1 log(n) 

r s = -t-t: 7ti— 7-^ < r log — r < - + 



m(m n — 1) log(m) rre n — 1 V m — 1 / n re 

The last estimate is due to the fact that \og{x)/x is clearly bounded by 
(l + log(x))/x for x > 1, and the latter is a decreasing function, and then by 
applying ()3.4p . Putting the estimates for n through rs together proves ()3.2|) . 
Let now < m < 1. As m 23 '/(l — m J ) is a decreasing function in j (to 

2 

see this, note that is increasing on < x < 1) we have 



m 2j m 2 m 2n rn m 2x 
r < 1 h / < 

— m> 1 — m 1 — m n Ji 1 — m x 



m 2 m 2n m-m n + \og{^) 

1 — m 1 — m n log(m) 

which implies that 

1 — m mil — m) 2m 2n (l — m) 2 

n(m, n) sC — H H — 

/v ' 1 2(1 - m n ) 1 - m n m(l - m n ) 2 

(l-m) 2 (m-m n ) (l- m ) 2 log(^) 

m(l — m n ) log(rre) mil — m n ) log(m) 

= : ri + r 2 + r 3 + r 4 + r 5 . 
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Note that = ]P£=d mk > m " n > which y ields 

1 — m 1 m n 1 



Hence 
and 



1 — m n ~~~ m n n 1 — m n (1 — m)n 

1 1 1-m 1 

<l + 7 r~ <l-m+-. 

1 — m n (1 — m)n 1 — m" n 



. 1.5 

t\ +^2 ^ 1.5(1 — m) H 

n 



r 3 = m 2 ™" 1 (^^r) 2 < 2(1 - mf + < 2(1 - m) + -. 

V 1 — m n / rr n 



(3.5) 



It is easy to see that ^ 0. 

Using again that m — 1 ^ mlog(m), 



r 5 = 7X\ T^T < - 



m(l — m n ) log(m) 



1— m.' 1 
1— m 



Under the restriction i ^ m < 1 and n ^ 2, we can now apply Lemma [3] 
below for a = -jzSr> & = ana - c = n due to (|3.5p and we obtain 

. l + MA) , l + log(n) 

n 

1— m 

Putting all the estimates together proves (|3.3p . □ 
Lemma 3.2. Zei a, b and c be real numbers bigger than 1 such that 

- < t + - < 1. (3.6) 
a b c 

Then 

log(a) < 1 + log(b) , 1 + log(c) 
a b c 

Proof. It is clear from the monotonicity of the logarithm function that for 
x, y > we have 

xlog(x) + ylog(y) < (x + y) log (a; + y). 

Hence, 

(sc + y)(l - log (a; + y)) < x(l - log(x)) + y(l - log(y)). 
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Rewriting this inequality for x = 1/b and y = 1/c, we have 



1 + log(lfc) < l + log(ft) l + log(c) 



be 
b+c 

Noting that 1+log ( a ) is a decreasing function for a 1 and noting that 

be 



a > ^ > 1 from 



log(a) < l + log(a) < l + log(fe) 



be ' 
b+c 

which proves the claim. □ 

4 VISITS TO THE ORIGIN FOR A TWO DIMENSIONAL SIMPLE 

RANDOM WALK 

Exponential approximation results for sums of nonnegative random vari- 
ables Xi,X2, ■ ■ ■ ,X n s atisfying the cond it ion VaxC E(Xi\X-\ , . . . AQ_i)) = 



for all i were given in iPekoz and Rollinl (jin press! . Theorem 3.1), but not 



for more general dependent sums. Here we give an extension to sums of 
arbitrarily dependent nonnegative random variables, apply it to occupation 
times for Markov chains and then illustrate it by getting a new exponen- 
tial approximation rate for the number of times a general two-dimensional 
random walk revisits the origin. 

Theorem 4.1. Let W = A^"=i Xi where X±,X2, . . . ,X n are (possibly de- 
pendent) nonnegative random variables and let A = l/E^^X,. Suppose 
Wi(x) is independent of all else and 



&{W i (x))=#(\Y i X m X i = x\ 

^ 771=1 ' 



Let L be independent of all else with P[J = i] = AEAj, let U be a uniform 
random variable on (0, 1), and let Xf be independent of all else with the 
size-biased distribution with respect to Xi. Then 

W e = W^XD + XUXf 

has the equilibrium distribution with respect to W . 
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Proof. Let S m = A^™ 1 Xj. By first conditioning on I and U, and using 
and jgf(Si) = JSf(Wi(Xi)) in the third line we get 

n „i 

E/'(W e ) = A VEX, / E{/'(Wi(X?) + A«X?)}du 
~1 Jo 

f(W i (Xt) + \X?)-f(W i (X?)) 



1=1 



i=i 

n 



Xf 



£>{/(#) - /(£_!)} = E/(W) - /(0). 



□ 



Remark 4.1. The argument goes through in the same way when instead 
we define 



JSf(Wi(x))=JSf(A £ X, 



m=j+l 



Xi = X 



We next apply the above result to Markov chain occupation times. Our 
next result gives a bound on the error of the exponential approximation for 
the number of times a Markov chain revisits its starting state. More general 
asympt otic results of this type, but without explicit bounds on the error, go 
back to iDarling and Kad (jl957l ) . 

Corollary 4.2. Consider a Markov chain started at time zero in a state 
and let Xi be the indicator for the event that the Markov chain is in state 
at time i. Let W m = XY^Li^i and A = l/E[^iLi X]. Then writing 
W = W n we have 

n n 
i=l j=n—i+l 

Proof. Using the notation of the previous Theorem 14.11 and Remark 14.11 in 
this setting the strong Markov property gives 

JZ>(Wi)=3?(x X m X i = l) =3?(xJ2Xm) 



and so we can let 



n-I 



r = A^i, + xu, 



i=l 
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where P[J = i] = AEJQ, and conditioning on / gives 



n n 



E|W- W e \ < A + A 2 J^ ^ EXjEX,- 

i=l j=n— i+l 

and then (|2.3p gives the result. □ 
We next consider a general random walk on th e two dimensional i nteger 



lattice started at the origin. As a consequence of lLawler and Limid (12010 . 
p. 24) we have the following lemma. 

Lemma 4.3. Let Z n be an irreducible and aperiodic random walk on H? 
with mean zero and finite third moment. Then there are positive constants 
c\ and C2 such that 



^ < P[Z n = 0] < ^ 
n n 



for sufficiently large n. 



We are now able to give a bound on the error of the exponential ap 
proximation for the number of times the random walk r evisits the origin. 



This t ype of result, for simple random walk, goes back to lErdos and Taylor 
(|l960h . 



Corollary 4.4. Let Z n be an irreducible and aperiodic random walk on H? 
with mean zero and finite third moment. Let R be the number of return 
visits to the origin by time n, and let W = \R, where A = 1/Ei?. Then, 
there is constant C independent of n such that 

dw(-ZW,~S?(Exp(l)) ^ 



log n 
for all n. 

Proof. Let X n = l{z n =o} be the indicator for the event that the random 
walk revisits the origin at time n. Lemma 14.31 gives A ^ C/logn and thus 
the result follows from Corollary 14.21 and, where C may be different (but 
independent of n) in each instance used, 

\ 2 Y y Ex i Ex,< 7r -^v— -L_<-^_. 

^— ' ^— ' log n r i (n — i) logn 

i=l j=n-i+l v & ' i=l K ' & 

□ 
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Remark 4.2. The result for the two-dimensional simple random walk 



sup \P[W > x] 

a<x<b 



C log log n 
log n 



for fixed a and b follows from lErdos and Taylor (jl96Cl Eq. (3.10)), so the 
above corollary can be viewed as a comple ment and extension. Using the 
method of moments. iGartner and Sunl (|2009l . Theorem 1.1) give an argument 
for the analogous exponential limit theorem for general random walks, but 
without a rate of convergence. 
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